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Abstract— A  high-resolution  raster-graphics  display  is  usually  combined  with  processing  power 
and  a  memory  organization  that  facilitates  basic  graphics  operations.  For  many  applications, 
including  interactive  text  processing,  the  ability  to  quickly  move  or  copy  small  rectangles  of 
pixels  is  essential.  This  paper  proposes  a  novel  organization  of  raster-graphics  memory  that 

permits  all  small  rectangles  to  be  moved  efficiently.  The  memory  organization  is  based  on  a 
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doubly  periodic  assignment  of  pixels  to  M  memory  chips  according  to  a  Fibonaccr  lattice.  The 
memory  organization  guarantees  that  if  a  rectilinearly  oriented  rectangle  contains  fewer  than 
M/m6)  pixels,  then  all  pixels  will  reside  in  different  memory  chips,  and  thus  can  be  accessed 
simultaneously.  Jf 

j.  —We)  also  define  a  continuous  analogue  of  the  problem  which  can  be  posed  as,  "HTiaf  is  the 
maxiwium  density  of  a  set  of  points  in  the  plane  such  that  no  two  points  are  contained  in  the 
interior  of  a  rectilinearly  oriented  rectangle  of  unit  area.  ”vk  show  the  existence  of  such  a  set 
with  density  1/^,  and  prove  this  is  optimal  by  giving  a  matching  upper  bound. 
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1.  Introduction 

The  length  of  one  memory  cycle  is  a  bound  on  how  quickly  a  single  pixel  (picture  element)  of 
a  raster-graphics  display  can  be  updated.  If  each  pixel  in  a  region  must  be  updated  individually, 
the  length  of  a  memory  cycle  will  produce  a  bound  which  is  unacceptable  for  many  real-time  or 
interactive  environments.  A  natural  way  to  avoid  this  bound  is  to  access  more  than  a  single  pixel 
at  a  time.  Since  the  memory  is  typically  partitioned  among  M  random- access  memory  chips,  as 
many  as  M  pixels  can  be  accessed  simultaneously,  provided  that  no  two  pixels  reside  in  the  same 
memory  chip. 

Figure  1  illustrates  a  common  organization  of  raster- graphics  memory.  Each  pixel  on  the 
screen  is  assigned  to  one  of  M  memory  chips  in  row-major  order.  Thus  in  every  row,  the  pixels 
in  column  m,  M-j-m,  2 M  +  »n,  and  so  forth  are  stored  in  the  the  same  memory  chip  m. 
This  organization  made  a  good  deal  of  sense  when  raster-graphic  displays  were  new  and  the 
interface  between  the  raster  memory  and  the  CRT  was  considered  complicated.  When  the  screen 
is  refreshed  from  memory,  the  line-by-line  horizontal  scan  accesses  M  pixels  in  a  row  and  converts 
them  into  an  analog  video  signal.  But  although  the  memory  system  achieves  maximal  parallelism 
for  the  screen  refresh  operation,  it  can  be  remarkably  inefficient  for  other  operations.  Updating 
a  vertical  line  of  pixels,  for  example,  requires  a  separate  memory  access  for  each  pixel. 


1234.  .  .  M  1  2  3  4  .  .  .  M  1  2  3  4  .  .  .  M 
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1234.  .  .  M  1  2  3  4  .  .  .  M  1  2  3  4  .  .  .M 
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Figure  1.  A  common  organization  for  raster-graphics  memory  which  is  efficient  for  raster 
scan^but  inefficient  for  vertical  updates. 

For  arbitrary  patterns  of  access  there  is  no  hope  of  maximal  parallelism  since  whatever  the 
organization,  an  adversary  can  choose  to  access  all  the  bits  in  a  single  memory  chip.  The  best  we 
can  hope  for  is  to  achieve  high  concurrency  for  a  limited  set  of  frequently  used  operations.  And 
today,  since  hardware  support  for  screen  refresh  is  relatively  well-understood,  attention  focuses 
on  those  operations  which  make  the  graphics  system  easier  to  program. 

Most  raster-graphics  applications  rely  on  the  copying  or  moving  of  a  rectangle  of  pixels  as 
a  basic  operation,  which  is  demonstrated  by  the  fact  that  this  operation  is  implemented  in  the 
microcode  of  most  graphics  processors.  The  ability  to  move  small  rectangles  quickly  is  especially 
important  in  text-oriented  applications. 

Recently,  a  display  was  developed  at  Carncgic-Mcllon  University  [3,6]  that  is  designed  to  move 
small  squares  quici.ly.  Figure  2  shows  how  pixels  arc  assigned  tc  memory  chips  in  the  case  of 
M  =  16  memory  chips.  The  screen  is  tiled  with  VAf-by-v/M  squares,  each  of  which  contains 
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Figure  2.  The  4-by-4  organization  for  raster-graphics  memory.  Every  4-by-4  square 
contains  pixels  from  distinct  memory  chips. 


a  pixel  assigned  to  a  different  memory.  The  attraction  of  this  scheme  is  that  any  y/Jd-by-y/M 
rcctilinearly  oriented  square,  whether  aligned  on  tile  boundaries  or  not,  contains  pixels  assigned 
to  different  memories.  Thus  any  square  of  area  M  can  be  accessed  in  one  memory  cycle. 

Unfortunately,  the  efficiency  of  the  raster-scan  operation  is  reduced  in  this  scheme  compared 
with  the  one  of  Figure  1.  The  line-by-line  scan  will  only  be  able  to  access  y/M  pixels  in  parallel 
because  every  y/M  +  1-by-one  horizontal  rectangle  contains  two  pixels  in  the  same  memory  chip. 
A  possible  solution  to  this  problem  is  to  stagger  the  tiles  so  that  the  second  column  of  tiles  is 
shifted  vertically  by  one  raster,  the  third  by  two  rasters,  and  so  on.  This  ad  hoc  solution  allows 
simultaneous  access  of  all  pixels  in  any  M-by-one  rectangle  as  well  as  simultaneous  access  of 
all  pixels  in  any  y/M- by-y/M  square,  but  it  suffers  from  asymmetry  of  horizontal  and  vertical 
dimensions  and  introduces  a  variety  of  other  complications. 

This  paper  asks  the  question,  "How  many  memory  chips  M  are  required  to  guarantee  that  all 
pixels  can  be  accesses  simultaneously  in  an  arbitrary  rectillinearly  oriented  rectangle  of  N  pixels  f” 
A  naive  organization  requires  M  =  N2  memory  chips,  but  we  can  do  much  better. 

This  paper  uses  techniques  from  number  theory  to  produce  novel  memory  organization  of  M 
y/5N  chips  that  allows  all  pixels  in  any  rectangle  of  area  N  to  be  simultaneously  accessed.  The 
scheme  is  regular — a  doubly  periodic  function  in  the  plane — and  the  constant  y/5  is  approached 
from  below,  so  that  for  small  values  of  N,  the  constant  is  less  than  two.  Furthermore,  for  the 
frequently-used  operation  of  accessing  a  horizontal  line,  our  scheme  allows  simultaneous  access 
of  all  M  memory  chips. 

The  remainder  of  this  paper  is  organized  as  follows.  Section  2  discusses  a  continuous  model 
of  the  problem  that  prompted  our  (discrete)  solution.  Section  3  presents  the  doubly  periodic 
“Fibonacci”  organization  of  graphics  memory,  and  Section  4  provides  the  number  theoretic 
analysis  necessary  to  prove  that  the  scheme  works.  The  optimality  of  the  Fibonacci  organization 
is  proved  in  Section  5,  and  Section  6  discusses  the  addressing  mechanisms  needed  to  make  the 
scheme  work  in  practice.  Section  7  contains  some  concluding  remarks. 

2.  A  continuous  analogue 

In  this  section  we  introduce  a  continuous  analogue  to  the  discrete  problem.  We  define  a  set  of 
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compatible  points  as  a  set  of  points  in  the  plane  such  that  no  two  points  in  the  set  are  contained  in 
the  interior  of  a  rcctilincarly  oriented  rectangle  of  unit  area.  The  question  we  ask  in  this  section 
is,  “ What  is  the  maximum  density  of  a  set  of  compatible  points?”  We  construct  a  set  of  compatible 
points  whose  density  is  1  /\/5,  and  we  prove  that  this  density  is  maximal. 

The  correspondence  between  this  problem  and  the  discrete  problem  introduced  in  the  previous 
section  is  as  follows.  First,  the  continuous  problem  deals  with  rectangles  of  unit  area.  A  closer 
correspondence  to  the  discrete  problem  uses  rectangles  of  area  N.  The  set  of  compatible  points 
then  corresponds,  in  the  discrete  problem,  to  the  set  of  pixels  which  reside  in  the  same  memory 
chip,  and  the  density  of  points  corresponds  to  the  reciprocal  of  the  number  M  of  memory  chips. 
The  principal  difference  in  formulation  is  that  in  the  continuous  model,  we  no  longer  require  that 
the  “pixels”  fall  on  grid  points. 

The  statement  in  the  continuous  problem  that  the  rectangles  have  unit  area  instead  of  area 
N,  however,  results  in  no  loss  of  generality.  Any  set  of  points  such  that  no  two  are  contained  in  a 
rectangle  of  area  N  can  be  mapped  to  a  set  of  compatible  points  by  shrinking  the  coordinates  of 
each  point  by  a  factor  of  \/N.  Observe,  however,  that  this  linear  transformation  does  not  work 
for  the  discrete  case  where  all  points  must  have  integer  coordinates. 

We  shall  find  it  convenient  to  adopt  some  standard  terminology  from  geometry  of  numbers. 
A  lattice  is  a  set  of  points  that  can  be  expressed  as  an  integral,  linear  combination  of  linearly 
independent  (over  R)  basis  vectors.  If  there  are  only  two  basis  vectors,  we  define  the  parallelogram 
with  the  two  basis  vectors  as  sides  the  basic  region  of  the  lattice.  The  fundamental  lattice  is  the 
lattice  generated  by  the  basis  vectors  (0, 1)  and  (1,0),  and  we  call  its  points  grid  points.  Many 
properties  of  lattices  can  be  found  in  [4]. 

We  formally  define  a  set  S  of  points  in  Ra  as  being  a  set  of  compatible  points  if  for  any  pair 
of  points  (xi.yi)  and  (x2,y2)  in  the  set,  we  have 

|(xi  —  x2)(y,  —  y2)|  >  1 . 

Around  every  point  P  drawn  from  a  set  5  of  compatible  points,  there  is  an  infinite-area  forbidden 
region  bounded  by  two  hyperbolae  inside  which  no  other  point  of  S  may  lie.  Figure  3  shows  the 
forbidden  region  for  a  point  at  the  origin.  The  points  in  that  forbidden  region  satisfy  |xy|  <  1. 

In  the  discrete  model,  the  problem  is  to  minimize  the  number  M  of  memory  chips  required 
to  allow  simultaneous  access  of  any  rectangle  of  N  pixels.  For  an  arbitrary  scheme  of  assigning 
pixels  to  memory  chips,  in  a  square  region  of  A  pixels,  there  will  be  some  memory  chip  with  the 
largest  number  k  of  pixels  in  the  area.  Therefore,  the  number  of  memory  chips  M  is  at  least 
A/k,  or  1/d  where  d  is  the  maximum  density  of  pixels  from  a  single  memory  chip  in  the  square 
region. 

The  analogue  to  minimizing  the  number  of  memory  chips  is,  in  the  continuous  model,  to 
maximize  the  density  of  points  in  a  set  of  compatible  points.  Formally,  we  define  the  density  of 
an  arbitrary  set  of  points  5  as 

<f(S)  =  lim  sup  IMtiipMi! , 

r  — •  oo  VT 

where  P(r)  is  a  disk  centered  at  the  origin  with  radius  r. 
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Figure  3.  The  forbidden  region  around  a  point  at  the  origin.  If  the  origin  it  in  a  compatible 
set,  then  all  the  other  points  in  the  compatible  set  must  fall  outside  the  region  defined  by 
the  hyperbolae. 


We  shall  construct  a  set  5  of  compatible  points  in  the  plane  whose  density  is  l/>/5,  and 
then  demonstrate  the  optimality  of  the  construction  by  proving  a  matching  upper  bound  on  the 
density  of  compatible  sets. 

Theorem  1.  The  lattice  that  is  generated  by  the  basis  vectors  (y/TJ$,y/#)  and  (—y/d>,\/l/<t>) 
form  a  compatible  set  whose  density  is  l/v^5,  where  <t>  —  £(1  +  y/5)  is  the  golden  ratio. 

Proof.  For  simplicity,  denote  (>/l /<t>,y/<p)  by  (a,b).  The  lattice  points  are  compatible  if  and  only 
if  for  all  integers  u  and  v,  the  lattice  point  v(a,  b)  +  u(—b,  a)  =  (av  —  bu,  bv  -f  au)  is  outside  the 
forbidden  region  around  the  origin  (since  the  lattice  is  invariant  under  translations  by  its  basis 
vectors).  Equivalently,  for  all  pairs  (u,  v)  =£  (0, 0),  we  must  have 

|(av  —  bu)(bv  +  au)|  >  1 . 

We  can  rewrite  the  product  as 

(av  —  bu)(bv  -f  au)  =  abv 2  -f  (a2  —  b2)uv  —  abu 2 

=  v2  —  (<b  —  1/ ^)uv  —  u2 

2  2 
=  V  —  uv  —  u  . 

Since  the  Diophantine  equation  v2  —  uv  —  it2  =  0  has  no  solution  except  u  =  v  =  0,  it  follows 
that 

|(av  —  6u)(6v  +  ow)!  =  |u2  —  uv  —  u2|  >  1 , 

and  thus  the  lattice  points  arc  indeed  compatible. 

The  area  of  the  basic  region  of  the  lattice  is  a2  +  62  =  <p  -f  l/<i,  which  is  \/5.  Since  there  is 
a  one-to-one  correspondence  between  lattice  points  and  lattice  squares,  the  density  is  l/y/b.§ 
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This  lattice  is  not  the  only  one  that  achieves  a  density  of  1  / v/5.  In  fact,  Tom  Leighton  has 
observed  that  there  are  an  infinite  number  of  lattices  of  compatible  points  that  have  this  density. 
For  any  t  the  lattice  generated  by  the  basis  vectors 


also  achieves  the  bound.  The  lattice  of  Theorem  1  is  a  member  of  this  family  of  lattices  (choose 
t  =  yjl /<t>),  although  the  basis  vectors  given  in  the  theorem  arc  different.  The  advantage  of  the 
basis  vectors  defined  in  the  theorem  is  that  they  define  a  basic  region  which  is  square,  which,  as 
we  shall  see  in  Section  4,  simplifies  somewhat  the  analysis  of  the  discrete  solution. 

3.  A  Fibonacci  lattice  organisation  of  raster-graphics  memory 

This  section  describes  an  organization  of  raster-graphics  memory  which  is  based  on  an  integer 
approximation  of  the  lattice  scheme  from  Theorem  1.  This  organization  has  the  property  that  all 
pixels  in  a  rcctilinearly  oriented  rectangle  can  be  accessed  simultaneously  as  long  as  the  rectangle 
contains  no  more  than  N  pixels.  The  number  M  of  memory  chips  required  is  at  most  y/5 N,  but 
for  many  practical  values  it  is  less  than  2 N. 

The  real-world  problem  differs  from  the  continuous  analogue  given  in  Section  2  in  that  the 
locations  of  pixels  must  have  integer  coordinates.  This  subtle  constraint  causes  the  problem 
to  change  in  two  ways.  First,  the  actual  bounds  are  better  for  the  discrete  case  than  for  the 
continuous  case,  although  asymptotically  they  are  the  same.  Second,  the  proofs  become  more 
involved. 

Not  surprisingly,  the  raster-graphics  organization  is  similar  to  the  scheme  in  Theorem  1,  which 
suggests  two  basis  vectors  be  used  to  generate  the  locations  of  all  pixels  within  the  same  chip  of 
the  raster-graphics  memory.  Pixels  are  assigned  to  chips  as  follows.  Let  a  and  b  be  two  relatively 
prime,  nonnegative  integers  which  will  be  specified  precisely  later.  The  two  orthogonal  vectors 
(a,  b)  and  (—6,  a)  determine  a  lattice  in  the  plane,  consisting  of  all  points  of  the  form 

v(a,6)  +  u(-6,a), 

where  u  and  v  are  integers.  Except  for  the  corners,  no  other  grid  point  lies  on  the  boundary  of 
the  basic  region  because  a  and  b  arc  relatively  prime.  By  including  exactly  one  of  the  four  corner 
points  in  the  basic  region,  the  region  can  be  used  to  tile  the  entire  plane.  Thus  the  number  of 
grid  points  in  the  basic  region  equals  its  area  o2  -f-  b2.  Each  of  the  grid  points  in  the  basic  region 
is  mapped  into  one  of  M  =  a2  -+-  62  distinct  memory  chips.  The  grid  points  in  the  plane  are 
partitioned  into  M  equivalence  classes.  Each  equivalence  class  corresponds  to  a  translation  of 
the  lattice.  All  points  in  the  same  equivalence  class  are  assigned  to  the  same  memory  chip.  Since 
each  equivalence  class  has  a  unique  representative  in  the  basic  region,  A/  memory  chips  are  used. 

In  the  next  section,  we  will  show  that  the  choice  of  successive  Fibonacci  numbers  a  =  Fr  and 
b  =  Fr . j_i,  which  yields  the  number  of  memories  M  =  Fir+i,  guarantees  that  every  rcctilinearly 
oriented  rectangh  containing  no  more  than  M  jV 5  pixels  can  be  accessed  simultaneously.  Figure 
4  illustrates  this  “Fibonacci  lattice’’  organization  for  thirteen  memory  chips  (a  =  2,  b  =  3). 
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Figure  4.  The  Fibonacci  lattice  organization  for  M  =  L3  memory  chips.  Every  rec* 
tilinearly  oriented  rectangle  having  no  more  than  N  =  U  pixels  has  the  property  that  all 
pixels  arc  from  distinct  memory  chips. 


Here,  the  situation  is  even  better  than  we  promised — any  rectangle  with  at  most  eleven  pixels 
contains  no  two  pixels  from  the  same  memory  chip. 

Furthermore,  observe  in  the  figure  that  one- by- 13  and  13-by-one  rectangles  have  no  conflicts, 
which  is  the  best  one  can  do  with  M  =  13  memory  chips.  This  circumstance  is  not  mere  luck. 
Lemma  2:  Let  a  and  b  be  relatively  prime  integers,  and  let  M  =  a2  -f  62.  The  doubly 
periodic  memory  organization  with  M  memory  chips  that  ts  based  on  the  lattice  generated  by 
basis  vectors  (a,b)  and  (— b,a)  has  the  property  that  any  one-by-M  or  M-by-onc  rectilinearly 
oriented  rectangle  contains  no  two  pixels  from  the  same  chip. 

Proof.  Since  the  organization  is  doubly  periodic,  we  can  consider  a  horizontal  or  vertical  line  that 
starts  at  the  origin  and  determine  the  next  lattice  point  that  falls  on  the  line.  If  the  line  is  vertical, 
all  pixels  on  it  have  i-coordinate  zero.  The  general  form  of  lattice  points  is  v(a,6)-f-  u(— b,  a)  = 
(aw  —  bu,  bv  +  au),  and  thus  all  lattice  points  on  the  line  will  have  av  —  6u  =  0.  It  follows 
that  a  divides  bu,  but  since  a  and  b  arc  relatively  prime,  we  can  conclude  that  a  divides  u,  and 
similarly,  b  divides  v.  Furthermore,  u  and  v  necessarily  have  the  same  sign,  which  means  that  the 
magnitude  \bv  -+■  au|  of  the  v-coordinate  is  |bu|  -j-  |au(.  Since  a  divides  u,  we  have  |u|  >  a,  and 
by  the  same  reasoning,  |v|  >  b.  Therefore,  ]6v|  -f  |au|  >  b2  +  a2  =  M,  and  the  magnitude  of 
any  lattice  point  on  the  vertical  line  is  at  least  M.  Thus  any  one-by-M  rectangle  cannot  contain 
two  pixels  from  the  same  chip.  Horizontal  lines  are  treated  the  same  way.| 

The  following  table  describes  the  actual  values  we  get  for  M  and  N  in  the  Fibonacci  lattice 
organization. 

M  5  13  34  89  233  610 

N  5  11  23  53  125  307 

Notice  that  for  all  the;  values,  the  size  N  of  rectangles  that  are  guaranteed  to  have  no 
conflicts  is,  in  fa  ,  large'  nan  M/2.  Thus  for  practical  values  of  M,  the  overhead  in  allowing 
fast  access  to  arbu.  "  '.y  shaped  rectangles  of  pixels  is  small. 
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4.  Mathematical  analysis 

In  this  section,  we  analyze  the  properties  of  the  Fibonacci  lattice  organization  described  in 
Section  3.  The  basis  vectors  for  the  raster-graphics  memory  organization  are  (Fr,Fr+l)  and 
(— Ff+i,/*Y),  where  Fr  is  the  rth  Fibonacci  number.  (When  we  do  not  rely  on  the  Fibonacci 
properties  basis-vector  components,  we  shall  denote  the  basis  vectors  by  (a,  6)  and  (—6,  a).)  We 
show  in  this  section  that  the  number  of  memory  chips  M  in  the  organization  is  approximately 
\/5  times  the  size  N  of  the  maximum  size  rectangle  guaranteed  to  have  no  conflicts. 

The  approach  in  this  section  is  to  find  the  minimum  size  MIN  of  a  rectilinearly  oriented 
rectangle  containing  two  distinct  lattice  points.  The  size  of  a  rectangle  is  defined  to  be  the 
number  of  pixels  in  the  rectangle.  Notice  that  this  definition  of  size  differs  from  the  continuous 
model  since  the  number  of  pixels  in  a  rectangle  determined  by  two  grid  points  equals  its  area 
plus  half  its  perimeter  plus  one.  Since  MIN  is  the  minimum  size  of  any  rectilinearly  oriented 
rectangle  containing  two  lattice  points,  N  —  MIN  —  1  because  no  rectangle  of  size  strictly  less 
than  MIN  contains  two  lattice  points. 

To  find  MIN ,  notice  that  since  the  lattice  is  invariant  under  translations  by  its  basis  vectors, 
we  lose  no  generality  if,  instead  of  discussing  all  pairs  of  lattice  points,  we  restrict  ourselves  to 
those  pairs  one  of  whose  elements  is  the  origin.  Furthermore,  since  we  are  interested  in  the 
minimal  size,  it  suffices  to  consider  only  those  rectangles  that  have  the  two  lattice  points  at 
opposite  corners.  The  first  lattice  point  is  the  origin  and  the  second  lattice  point  has  the  form 
v(a,b)  •+•  u(—b,a),  and  hence  the  size  of  the  rectangle,  its  area  plus  half  its  perimeter  plus  one, 
is 

S[u,  v)  =  (|au  +  6v|  +  1)(|— bu  -f  aw|  +  1) . 

The  value  MIN  is  the  minimum  of  S(u,  v)  over  all  integers  «  and  v  not  both  0.  In  order  to  find 
MIN,  we  first  translate  S(u,  v)  into  a  simpler  form. 

Lemma  3.  Let 

S(u,  v)  =  (|Fru  +  Fr+1v|  +  l)(|-Fr+lu  +  Fru|  -|-  1) , 
and  let  S{u,  v)  =  (|F2ru  —  F2r+j  v|  -f  l)(|u|  -f  1).  Then 

MIN  *=  min  5(u,  v) 

(*,»)*«>, 0) 

=  min  §(u,  v) . 

(«,v)^(0,0) 

Proof.  We  shall  show  that  the  range  of  S  is  the  same  as  the  range  of  S  by  using  an  intermediate 
form  Q.  For  simplicity,  we  shall  use  the  notation  a  =  Fr  and  b  =  Fr+i  introduced  above. 

Define  the  intermediate  form 

B(u,  v)  —  S(ku  —  bv,  —lu  -f  av) , 

where  k  and  l  are  integers  such  that  ak  —  bl  —  1.  (The  integers  k  and  l  exist  because  the  greatest 
common  divisor  of  a  and  6  is  one.)  The  linear  transformation  given  by 


CM-*,  ~X 


.- r.-v;:-.  mm 


is  a  bijectinn  since  the  determinant  of  the  matrix  is  one.  Thus  as  (u,v)  ranges  over  Z2,  the 
°ra»  red  pair  (ku  —  bv,  —lu  +  av)  also  takes  on  all  values  in  Z2,  and  hence  the  range  of  S  is  the 
same  as  the  range  of  B.  Since  the  linear  transformation  is  a  bijection  which  maps  (0,0)  to  (0,0), 
we  have 

min  S(u,v)  =  min  B(u,v). 

(t».u)^(0,0)  (u.iOtMO.O) 

If  we  expand  B(u,  v),  we  get 

B{u,  v )  =  S(ku  —  bv,  — lu  -f-  av) 

=  (|a(A:u  —  bv)  +  b(—lu  aw)(  -f  1)(|— b(ku  —  bv)  -f-  a(—lu  -f-  au)|  4-  1) 

=  (M  +  l)(|(a2  +  b2)v  —  ( bk  +  al)u|  +  l)  , 

which  has  the  form  (|u|  -f-  l)(|Mu  —  Cu\  +  I).  (Note  that  M  =  a2  62  is  the  number  of  memory 
chips.) 

In  order  to  obtain  S(u,  v),  we  first  determine  the  explicit  coefficients  M  and  C  in  B{u ,  v)  when 
the  components  of  the  basis  vectors  are  the  Fibonacci  numbers  a  =  F,  and  b  =  Fr  + 1-  We  use 
the  following  two  Fibonacci  identities: 

Fi+j  —  FjFt-  f-i  4*  F)  —  iF, , 

Ft+XF^  -  F]  =  1)‘. 

From  the  first  identity,  we  get  that  the  number  of  memories  M  is 

M  =  a2  +  b2 
=  F2r+F2r+  , 

=  F2r  +  l  • 

To  find  C,  observe  that  the  k  and  l  such  that  ak  —  bl  —  1  are  k  —  (— I)r+1Fr  and  l  — 
(— l)r+lFr_i.  Hence,  by  using  the  second  identity,  we  have  that 

C  =  bk  +  al 

=  (_1)'+i(Fr+iFr+FrFr_i) 

=  (-l  Y  +  'Fr+r 

=  (-iy+xF2r. 

Thus  for  a  —  Fr  and  b  =  Ff+i,  we  have 

B(u,  v)  =  (|u|  +  l)(|(-l)rF2ru  +  F2t+ i»|  +  1)  • 

The  form  S  was  defined  in  the  statement  of  the  lemma  as 

S{u,  v)  =  (|u|  +  l)(|F2rti  —  F2r+ivj  +  1) . 

If  r  is  odd,  then  ( — l)r  =  — 1,  and  therefore  B(u,v)  =  S(u,v).  If  r  is  even,  on  the  other  hand, 
then  B{u,—v)  =  S(u,v).  Since  wc  have  already  shown  that 

min  S(u,v)  =  min  Blu,  v) , 

(u, «,)*(<>, 0)  (v,v)^(0,0) 

we  get 

min  S(u,v)  ■—  min  S(u,v), 

(u,O)/(0,0)  (u,v)^(0,0) 


which  was  to  be  proved-B 


The  next  lemma  gives  the  exact  solution  for  MIN,  which  by  Lemma  3  is  the  minimum  value 
of  S(u,  v). 

Lemma  4.  Let  S(u,v)  =  (|u|  4  1)(|F2tu  —  F2r+ii/|  4  1)-  Then  min(Uit,)y£(0io)  S(u,  v)  = 
(Fr  4  l)(Fr+i  4  1). 

Proof.  We  first  show  that 

MIN  =  min  Slu,  v) 

(«,v)*(0.0) 

=  min  (|u|  4  l)(|F2ru  —  F2r+1t/|  4  1) 

(i»,u)^(0,0) 

=  min  (Fn  +  n+l  +  1)  i 

0<n<2r  +  l 

and  then  show  that  the  latter  minimum  is  ( Fr  4  l)(F,_n  1). 

It  suffices  to  consider  nonnegative  values  of  u  since  S(u,v)  =  S(— u, — u).  The  value  MIN 
cannot  exceed  S(0, 1)  =  F2r  +  i  4  1,  but  because  S(u,v)  >  u  4  1  (the  right  factor  is  at  least 
one),  we  need  only  seek  a  better  value  for  MIN  in  the  interval  0  <  u  <  F2f-f-i. 

The  key  idea  is  to  divide  the  half-open  interval  [l,F2(+i)  into  subintervals  [F„,Fn+i).  for 
n  =  2, 3, ...,2r.  (Notice  that  Fj  =  F2  =  I,  and  thus  n  starts  from  2.)  The  integer  u  lies 
inside  one  of  these  intervals.  Consider  the  fraction  F2r/F2r  +  j.  The  convcrgents  of  its  continued 
fraction  expansion  arc  Fi/F2,  F2/F3, . . . ,  F2r/F2r+ 1.  By  the  continued-fraction  approximation 
theorem  (4,  Theorem  181,  p.  151),  if  Fn  <  u  <  F„+1,  then  for  every  integer  v  we  have 


F2r  _  v  >  F2t 

F2T  +  l  u  ~  ^2r  +  l 


Fn- 1 
Fn 


Multiplying  through  on  both  sides  yields 


uF2r  —  vF2r  + 1  FirFn  —  Fn—lFir  +  l 

**^2r  +  l  “  F2r  +  lFn 

Using  the  Fibonacci  identity  |F,F,  —  F,+iFy_i|  =  F%—j+ i,  we  get 

|uF2r  —  vF2t-m|  >  7r|F2fFn  —  Fn-jFjr-pil 

**  n 

>  \FirFn  —Fn—  lFar-f-tl 
=  F2t — n+1  • 

To  summarize,  if  u  falls  in  the  interval  (Fn,Fn+i),  then  |uF2r  —  t>F2r+i|  >  F2r_n-f-i- 
Therefore, 

(u|  +  l)(|f*2ru  —  Fir-t-lV]  4  1)  >  {Fn  4  l)(F2r_n-i-l  4  1)  > 
and  equality  is  achieved  when  u  =  F„  and  v  =  Fn_ j.  As  a  result,  we  have 

,  7H",  „,(M  +  Od^arM  — Fa,+|t>|4  1) 


10 


which  completes  the  first  part  of  the  proof. 

The  second  part  of  the  proof  is  to  show  that  indeed 


min  (f.  +  IX*,  —  n+  1  +  !)  =  (^v  +  l)(Fr  +  l  +  1)  • 


0<n<2r-)-l 


If  we  define 


E(n,  r)  —  (Fn  -f  l)(Far— n+i  +  1)  > 


then  what  we  want  to  show  is 


„  min  E{n,r)  =  E{r,r). 

0<n<2r-f  1 


Since  E(n,r)  is  invariant  when  n  is  replaced  by  2r  — (-  1  —  n,  it  suffices  to  consider  values  of  n  in 
the  interval  [0,  rj. 

We  now  show  that  E(n,  r)  is  no  larger  than  E(n  +  1,  r)  for  n  =  1, . . ,  ,r  —  1,  after  which  we 
shall  complete  the  proof  by  demonstrating  that  E(0,r)  >  E(r,r).  We  make  use  of  the  explicit 
formula 

<j>r  —  6f 
Ft  =  *—2- 

\/5 

for  a  Fibonacci  number  in  terms  of  the  golden  ratio  and  its  conjugate  0  =  J(1  —  V$)  in  order 
to  obtain  an  alternative  expression  for  the  high-order  term  of  E[n,  r ): 


FnF2r+l— n  = 


fr+l-n 


**'+»  +  y  _ 


-«lw  5ar  +  !-n 


=  Cr  + 


NT 


^F2r_ 


3n+l  + 


2-2r-2B+P 


where  Cr  is  a  constant  depending  on  r  alone.  Taking  advantage  of  the  fact  that  |^|  is  less  than 
1  and  using  the  basic  recurrence  for  Fibonacci  numbers,  we  have 

E(n,r)-£(n  +  l,r) 


—  F2f4.i_n  +  Fn  + 


—  Fjr—n  —  Fn+ 1  — 


Vs  [ 

(~l)w+a 

Vs 


^F iT — 3n-)-l  + 

-^Far_2»_ 


>2r — 2n — 1 ' 


>F2f_n-l-Fn_,-^^-l 

VI 

>  F2f — n — i  —  F„_1  —  F2r — 2n 

>0, 

and  hence  E(n,  r)  is  at  least  as  large  as  E{r,  r)  for  n  ==  1, . . . ,  r  —  1. 

As  for  the  remaining  inequality  E(0,  r)  >  E(r,  r),  it  is  merely  FrFr+i-f  Fr+2  +  l  <  F2f+ i+l, 
and  its  truth  may  be  verified  by  using  the  identity  F*  -(-  F*+1  =  F2r+ i  J 


X 


Lemma  5.  The  minimum  size  of  a  rectilinearly  oriented  rectangle  that  contains  two  points  of 
the  lattice  generated  by  the  basis  vectors  (Fr,Fr±\)  and  (—Fr  +  i,  Fr)  is 


M1N  =  [F,  +  l)(Fr+,  +  l). 


Proof.  The  proof  follows  directly  from  Lemmas  3  and  4.| 


Theorem  6.  Let  M  —  F2t+\,  and  let  N  =  FrFr-\-\ -\-Fr  ^.2.  Then  there  is  an  organization  for 
raster-graphics  memory  with  M  memory  chips  such  that  every  rectilinearly  oriented  rectangle 
of  size  at  most  N  contains  pixels  from  distinct  memory  chips.  Furthermore,  N  is  greater  than 
MfsJ 5. 

Proof.  From  Lemma  5,  we  have  that  MIN  =  (Fr  -j-  l)(Ff_|_i  +  1),  and  since  N  —  MIN  —  1, 
we  get  N  =  FrFr+i  -f  Fr+2.  All  that  is  left  to  be  proved  is  that  N  >  M / \/Z .  Using  the  the 
explicit  formula  for  Fibonacci  numbers,  it  can  be  verified  that  the  sequence 


'  FrFr+ j  +  Fr+2  ' 


converges  to  l/\/5.  We  now  show  that  this  sequence  is  monotonically  decreasing,  so  each  of  its 
elements  is  at  least  as  large  as  the  l/\/o  limit,  which  will  complete  the  proof. 

It  is  enough  to  show  that  the  difference  of  consecutive  terms  in  the  sequence  is  positive,  or 
equivalently,  by  multiplying  through  that 


Fir+slKFr+l  +  Fr+2)  —  Fjf  +  l(Fr  +  iFr+2  +  Fr+3)  >  0. 


Using  the  explicit  formula  for  Fibonacci  numbers,  we  obtain  the  identity 


Fjr-f-3Fr  —  F2r  +  iFr+2  =  (—  l)r  +  1Fr  +  i , 


and  the  identity 


Fj,+3Ff-f  2  —  F2r4-iFf+3  =  Fir  +  lFf  +  F2rFr+ 2 


may  be  derived  by  induction. 

Multiplying  both  sides  of  the  first  identity  by  Fr+i  and  adding  it  to  the  second  yields 


F2f+3(FrFr+1  +  Fr+2)  —  F2r+i(Ff+iFr_)_2  +  Fr+3) 

=  F2r+1Fr  +  F2rFr+2  +  (-1  )r+lF^,  • 


The  right  hand  side  is  positive  because  Fr+l  is  less  than  both  F2t  and  Fr+2. | 

The  next  section  shows  that  the  Fibonacci  lattice  organization  is  essentially  the  best  possible. 


5.  Optimality  of  the  Fibonacci  lattice  organization 

This  section  shows  that  the  Fibonacci  lattice  organization  from  Section  3  is  essentially  the 
best  possible  by  providing  bounds  for  any  raster-graphics  memory  organization.  In  order  to  get 


"*  *  *  O  v'  O  O.  O  s.'  V  lO  O 


Wl 


the  bounds  for  the  memory  organization,  an  upper  bound  is  first  proved  in  the  continuous  model 
for  the  density  of  a  set  of  compatible  points.  In  particular,  we  show  that  any  set  5  of  compatible 
points  in  the  plane  has  density  d(S)  <  l/v/5. 

In  order  to  prove  the  the  density  bound  for  a  compatible  set  5,  we  consider  a  bounded  region 
of  S.  The  points  of  S  in  this  region  are  triangulated  in  such  a  way  that  most  of  the  triangles  have 
large  area  (at  least  s/l/2),  and  hence  the  density  of  S  in  this  region  is  small  (at  most  l/v/5  —  o(  1 )). 
Taking  the  limit  as  the  radius  of  the  region  tends  to  infinity  then  establishes  the  upper  bound  of 
1  /Vo.  First,  however,  we  introduce  some  terminology. 

Definition.  Let  P\  =  (ii.yi)  and  P3  =  (xj,y2)  be  two  points  in  the  plane.  We  denote 
the  (open)  rectilinearly  oriented  rectangle  defined  by  P\  and  P2  as  P(Pi,p2)  and  its  area  as 
A{PUP2)  =  |i!  —  X2II2/1  — 1/2 1*  We  also  denote  the  semiperimeter  of  the  rectangle  (the  l\ 
norm)  as  L(Pi,P2)  =  |*i  —  *2!  +  |yi  —  y3|- 
If  Pi  and  P2  are  compatible  points,  then  A(Pi,P3)  >  1.  Also,  we  have 

L(Pi,  P2)  =  |xi  —  z2|  4-  |yj  —  ya| 

>  2^/|n  —  X2(|iri  —  y2| 

>2, 

because  the  arithmetic  mean  is  at  least  the  geometric  mean. 

We  now  define  the  notion  of  good  and  bad  triangles,  and  show  that  a  good  triangle  has  area 
at  least  y/b/2. 

Definition.  Let  Pi,  P2,  and  P3  be  compatible  points  in  the  plane.  We  say  that  triangle 
APiP2P3  is  a  good  triangle  if 

•  P,  gP(P2,P3), 

•P2  €  P(Pi.^3),  and 
•P3tR(Pi,P2), 
and  a  bad  triangle  otherwise. 

Figure  5  gives  an  example  of  a  good  triangle  and  a  bad  triangle.  In  the  bad  triangle  of  the  figure, 
we  call  the  edge  P2P3  the  bad  edge,  and  we  call  the  angle  ZPjPiP3  the  bad  angle. 

The  next  lemma  provides  a  lower  bound  on  the  area  of  a  good  triangle. 

Lemma  7.  Any  good  triangle  has  area  at  least  y/i/2. 

Proof.1  Without  loss  of  generality,  we  assume  that  the  triangle  is  defined  by  the  three  points 
(0,0),  (xi,yi),  and  (x3,y3),  where  0  <  x3  <  xi  and  0  <  yi  <  y2,  because  any  good  triangle  can 
be  brought  to  this  position  by  translation  and  reflections  about  the  axes.  The  areas  A,  B,  and 
C  of  the  three  rectangles  defined  by  pairs  of  these  points  are  each  constrained  to  be  at  least  one 
since  the  points  arc  compatible,  and  hence 

A  =  *iyi  >  1 , 


B  =  Xjt/2  >  1 , 


'Thanks  to  Don  Coppersmith  of  HIM  who  provided  this  proof  which  is  simpler  than  our  original. 


Figure  S.  A  good  triangle  (top)  and  a  bad  triangle  (bottom). 


C  =  (z i  —  z2)(y2  —  yi)  >  1  - 

The  area  of  the  triangle  is  $(x iy2  -  x2yi)  which  we  wish  to  show  is  at  least  y/5/2. 
Substituting  yi  =  A/z  1  and  x2  —  B/ya  iuto  this  last  inequality  yields 

C  =  (x  1  —  B/y2)(y 2  —  A/xi) . 

Multiplying  through  by  Ziy2  gives 

(*ty2)J  —  (A  +  B  +  C)zty2  +  AB  =  0  . 

Similarly,  substituting  zj  =  A/yi  and  y2  =  B/z2  into  the  third  equation  gives 

(z2yi)2  —  (A  -f  B  -+•  C)x2'ji  +  AD  =  0  . 

Thus  both  zjy2  and  x2yi  are  roots  of  the  equation  s2  —  (A  -|-  B  +  C)s  4-  AD  =  0,  and  since 
*iya  >  z2y(,  we  have 


1 

*1^2  =  „ 


=  l (t  +  //  +  ('  -  /(.i  +  //  +  ( ;j-  -  tit//] . 


Wo  ran  now  hound  tin*  area  of  tin-  triangle  hy  si mpl<*  algebraic  manipulation: 

5  ( -r  i  </J  ~  J -HI  i )  =  y/{A  +  H  +  Cf^TAH 

it  it 

=  72  yf[A  '•  T  iaF+wc 

>  Vo +1  +  2  +  2 

=  ^•1 

Although  the  area  of  a  good  triangle  is  hounded  from  below,  the  area  or  a  had  triangle  can 
he  arbitrarily  small.  In  order  to  get  a  good  upper  hound  on  the  density  of  a  compatible  set,  we 
now  show  that  any  disk-shaped  portion  of  such  a  set  has  a  triangulation  with  Tew  bad  triangles. 

Lemma  8.  Let  S  be  a  set  of  compatible  points  in  the  plane,  and  let  Dr  be  a  disk  of  radius 
t.  Then  there  is  a  triangulation  of  the  points  in  S  P|  Dr  which  contains  no  more  than  \r  bad 
triangles. 

Proof.  Consider  triangulations  of  the  points  in  S  f)  Dr  that  minimize  the  number  of  bad  triangles, 
and  of  those  pick  one  that  minimizes  the  sum  of  the  bad  angles  of  the  bad  triangles.  Take  any 
bad  triangle  AP1P2P3  with  bad  edge  Pi If  the  bad  edge  T^jPj  does  not,  lie  on  the  boundary  of 
the  convex  hull  of  Sf|Pr,  ^icn  there  is  a  fourth  point  l\  €  Sf|  A-  such  that  triangle  A 
shares  the  edge  P2P3.  The  point  l\  is  lying  in  the  hair  plane  defined  by  P-i  1*3  which  docs  not 
contain  l\. 

Figure  6  shows  a  bad  triangle  in  the  triangulation,  which  is  the  general  case  except  for 
reflections  about  the  axes.  The  figure  also  outlines  the  live  possible  regions  in  which  P 4  could 
lie.  We  shall  show  by  case  analysis  that  in  a  “minimal”  triangulation,  the  only  feasible  region  for 
Pi  is  Region  5.  If  1\  is  in  any  of  the  other  regions,  replacing  edge  P2P3  wiLh  edge  P\l\  would 
improve  the  triangulation  thus  contradicting  its  minimality. 

Case  1.  If  P\  is  in  Region  1,  we  replace  the  two  bad  triangles  AP1P2P3  and  A P^P-sPa  by  the 
two  good  triangles  Af\  P2P4  and  AP|  P3P4. 

Case  2.  If  Pi  is  in  Region  2,  we  replace  the  bad  triangle  A P1P2P3  and  the  good  triangle  A P2P3P4 
by  two  good  triangles  AP| P2P4  and  A P1P3P4. 

Case  S.  If  P\  is  in  Region  3,  we  replace  the  bad  triangles  A P1P2P3  and  A P2P3P4  by  the  bad 
triangles  A P1P2P4  and  A P1P3P4.  Although  this  modification  does  not  reduce  the  number  of 
bad  triangles,  it  docs  reduce  the  sum  of  the  bad  angles.  We  assume  without  loss  of  generality 
that  Pi  is  in  the  upper  right  portion  of  Region  3.  The  two  bad  angles  were  originally  ZP2P1P3 
and  ZP2P4P3,  and  they  are  replaced  by  the  bad  angles  ZP2P1P4  and  ZP2P4P3.  This  is  an 
improvement  since  ZP2P1P4  <  ZP2P1P3  and  ZP1P4P3  <  ZP2P4P3. 

Case  4.  If  Pi  is  in  the  upper  right  portion  of  Region  4,  we  replace  the  bad  triangle  AP1P2P3  and 
the  good  triangle  AP2P3P4  by  the  bad  triangle  AP1P2P4  and  the  good  triangle  AP1P3P4.  The 
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Figure  6.  A  bad  triangle  whose  bad  edge  /’j  / a  is  not  on  the  boundary,  and  the  five 
regions  ill  which  point  /’,  could  lie. 


new  bad  angle  £P>  P\  Pi  is  smaller  than  the  original  bad  angle  Z/VW  The  lower  left  portion 
is  dealt  with  similarly. 

Thus  we  may  conclude  that  the  point  Pi  is  in  Region  5,  and  without  loss  of  generality 
we  assume  it  is  in  the  upper  right  portion  of  the  region.  The  triangle  A /W**  ■*  bad,  and 
—  IAP‘i>  ^l)  +  !\Pi,Pi)-  Since  the  points  /'3  and  l\  arc  compatible,  we  have  already 
shown  that  l\PiJ\)  >  2,  and  hence  l{P‘i,Pi)  >  l\P-i,  Pj)  +  2. 

Applying  the  same  arguments  to  the  triangle  A P^P^Pi,  we  obtain  a  chain  of  adjacent  bad 
triangles  with  increasing  bad-edge  lengths  in  the  /,  norm.  The  chain  cannot  cycle  back  on  itself 
because  the  edge  lengths  are  strictly  increasing.  Thus  the  chain  must  terminate  with  a  bad 
edge  PxPj  on  the  boundary  of  the  convex  hull  of  .V f)  Dr.  In  fact,  the  bad  edge  l\P}  can  be  the 
terminating  edge  of  more  than  one  chain  because  there  can  be  a  tree' of  bad  triangles  rooted  at 
the  triangle  with  bad  edge  /\ P3  on  the  boundary  of  the  convex  hull  of  Sf|  Pf . 

If  the  tree  contains  k  triangles,  then  the  boundary  edge  l\l\  has  length  lJ(Pi,Pj)  >  2k  +  2, 
which  we  now  show  by  induction  on  k.  For  k  =  0,  the  length  bound  holds  Tor  any  two  compatible 
points.  Let  Jb|  be  the  number  or  bad  triangles  in  the  subtree  converging  to  l\Plt  and  let  Ibj  be 
the  number  converging  to  l\P}.  (The  values  k\  and  *2  may  be  zero.)  Then  Jb  =e  *,  +  Jfc2  +  l,  and 
hence  the  induction  hypothesis  holds  for  both  subtrees  because  *|  and  kj  arc  each  less  than  k. 
Therefore, 

L(/\,r,)  =  L(pt,pl)  +  /^*t,pJ) 

>  (2*,  +  2)  +  (2*2  +  2) 

=  2(*i  +*2  +  I) +  2 
-  2*  +  2 , 

as  desired. 
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Figure  7.  A  tree  of  four  bad  triangles  which  terminate  with  edge  /’,/>• 


We  have  just  shown  that  ir  the  tree  has  fc  triangles,  then  the  boundary  edge  /*,/’,  has  length 
l\Px,Pj)  >  2fc  +  2,  and  hence  *  <  J L[Pi,Pj ).  Furthermore,  since  the  trees  rooted  at  two  bad 
boundary  edges  consist  or  disjoint  sets  of  bad  triangles,  we  can  bound  the  total  number  of  bad 
triangles  in  the  whole  triangulation  in  terms  of  the  length  (in  the  L  norm)  of  the  boundary  of 
the  convex  hull  of  $(")£,.  But  the  length  of  the  boundary  is  at  most  8 r,  and  therefore,  the  total 
number  of  bad  triangles  is  less  than  8r/2  ***  4r.| 

Lemma  8  shows  that  a  set  of  compatible  points  in  a  disk  can  be  triangulated  with  Tew  bad 
triangles.  Therefore,  most  of  the  triangles  arc  good  triangles  which,  by  lamina  7,  have  large 
area.  These  results  allow  us  to  give  a  l/>/5  upper  bound  on  the  density  of  compatible  sets. 

Theorem  9.  Let  S  be  a  »et  of  compatible  point »  m  the  plane.  Then  the  density  d(S)  of  S 
satisfies  d(S)  <  l/>/5. 

Proof.  Let  VT  be  a  disk  of  radius  r,  let  n  be  the  number  of  points  in  5f)Pn  and  suppose  the 
boundary  of  the  convex  hull  of  5  f)  Dr  contains  m  points.  By  Lemma  8  there  exists  a  triangulation 
of  the  n  points  with  at  most  4r  bad  triangles.  F/very  triangulation  of  the  m  points  contains 
2n  -  rri  —  2  triangles,  and  thus  the  number  of  good  triangles  is  at  least  2n  -  rn  -  2  —  4r.  A  lower 
bound  on  the  length  in  the  L  norm  of  the  boundary  of  the  convex  hull  of  SflPr  is  2 m,  and  an 
upper  bound  is  8r.  Hence  m  <  4r,  and  the  number  of  good  triangles  is  at  least  2n  —  8r  —  2. 

By  Lemma  7  the  area  of  a  good  triangle  is  at  least  y/b/2,  and  thus  the  total  area  occupied 
by  the  good  triangles  is  at  least  (2n  —  8r  -  2)(y/b/2)  =  (n  —  4r  —  l)\/5.  The  area  of  the  good 
triangles  cannot  exceed  the  circle  area,  so  (n  -  4r  -  l)\/5  <  nr2  and  n  <  n -f  4r  +  1 .  The 
density  of  points  within  Sf)^r  is  just 

n  1  4r  +  1 

*f2  y/l 


Letting  r  -»  oo  implies  d(S)  <  1  /n/5,  as  desired. | 
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We  arc  now  prepared  to  show  that  the  Fibonacci  lattice  organization  from  Section  3  is 
essentially  the  best  possible. 

Theorem  10.  For  any  organization  of  raster  graphics  memory  with  M  memory  chips  such  that 
every  rectilinearly  oriented  rectangle  of  size  N  contains  no  two  pixels  from  the  same  memory 
chip,  the  relation  M  >  \/5N  —  0[N3^*)  holds. 

Proof  sketch.  The  proof  parallels  that  of  Theorem  9.  The  principal  difference  is  that  the  size  of 
a  rectangle  includes  not  only  its  area,  but  also  half  its  perimeter  plus  1. 

Let  5  be  a  set  of  grid  points  in  the  plane  such  that  for  every  pair  of  points  Pj  =  (ii,yi),  P2  = 
(x2,  1/2)  >n  it,  the  compatibility  condition  (|ii  —  x2|  +  l)(|j/i  —  2/2 i  +  1)  >  N  holds.  Multiplying 
through  yields  |zi  —  x2 ||yi  —  y2|  >  iV  —  |xj  —  ar2|  —  |yt  —  y2|  —  1.  If  Pi  and  P2  are  two 
points  contained  within  a  circle  of  radius  r,  then  we  have  |ii  —  z2|  -f  |yt  —  y2|  <  2 \/2r,  so 
|xi  —  x2||yi  —  y2|  >  N  —  2\/2 r  —  1. 

Letting  c  =  N  —  2 y/2r  —  1  and  using  the  notation  of  Section  2,  we  have  A(Pi,P2)  >  c. 
Using  the  same  techniques  as  in  the  proof  of  Lemma  7,  Lemma  8,  and  Theorem  9,  we  can  prove 
that  the  density  dr  of  points  in  SQ  Dr  satisfies  cdT  <  >  w^ere  r'  —  rlV^-  Letting 

r  =  N3/*,  we  have  r'  >  N1^*,  and  hence 

(N  -  2>/2N3'*  -  1  )dr  <  —  +  • 

y/S  ^1/2 

Since  dT  is  an  upper  bound  on  the  density  of  pixels  which  can  be  stored  on  a  single  memory  chip, 
we  have  M  >  l/dr>  and  thus 


M  >  v/5 N 


(l  — 2>/5Af“|/4  — l/jv) 


=  VSN  -  0(N3/ 4)  .| 


I.  Addressing  scheme 

The  organisation  for  raster-graphics  memory  proposed  in  Section  3  guarantees  that  small  rec¬ 
tangles  contain  pixels  from  distinct  memory  chips.  In  order  for  the  entire  system  performance  to 
benefit  from  this  organization,  however,  the  address  calculations  must  be  easily  implemented.  We 
do  not  try  to  solve  all  the  engineering  problems  associated  with  making  this  memory  organization 
scheme  work,  but  in  this  section  we  give  indications  of  how  the  address  calculations  might  be 
efficiently  computed. 

The  addressing  mechanism  must  be  able  to  take  the  x-  and  y-coordinates  of  a  pixel  and 
generate  the  chip  number  and  address  within  the  chip.  Suppose  the  lattice  organization  is 
determined  by  two  basis  vectors  (a,b)  and  (—b,a).  Two  pixels  at  locations  (x0,yo)  and  (x,y) 
which  differ  by  an  integral  linear  combination  of  the  the  basis  vectors  lie  in  the  same  memory 
chip.  That  is,  they  have  the  same  memory  number  if  there  exist  (unique)  integers  U  and  V  such 
that 

(*.  y)  -  (tq,  yo)  =  U{ a,  6)  4-  V[-b,  a) . 
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One  natural,  but  inefficient,  addressing  mechanism  is  based  on  the  fact  that  each  of  the 
M  =  a2  b2  memory  chips  contains  exactly  one  representative  in  the  basic  region  with  corners 
(0,0),  (a,  6),  (—6,  a)  and  (a  —  b,a  - f-  b).  The  chip  number  of  a  pixel  (x,  y)  can  be  determined 
by  computing  which  pixel  (xo,Jio)  in  the  basic  region  is  from  the  same  chip,  and  then  using  the 
ordered  pair  (z0,j/o)  as  the  chip  number.  By  letting 


the  chip  number  (xo,J/o)  of  a  pixel  (x,  y)  is  then  (xo,yo)  =  (x,  y)  —  U(a,b)  —  V(— b,  a).  Fur¬ 
thermore,  the  ordered  pair  (U ,  V)  forms  an  appropriate  address  for  the  pixel  (x,  y)  within  the 
chip. 

The  addressing  mechanism  can  be  simplified  substantially  if  we  notice  that  any  arbitrary  set 
of  M  pixels,  no  two  of  which  are  from  the  same  chip,  can  be  used  as  a  set  of  representatives.  In 
particular,  any  pixel  differs  by  an  integral  linear  combination  of  the  basis  vectors  from  a  unique 
pixel  in  the  horizontal  line  extending  from  (0,0)  to  (0,  M  —  1).  This  scheme  corresponds  to 
tiling  the  plane  with  one-by-A/  bricks  instead  of  tilted  squares.  (Hoiladay  [5]  uses  a  similar  tiling 
scheme  for  halftone  generation.) 

To  derive  an  appropriate  addressing  scheme,  we  choose  an  alternative  pair  of  basis  vectors 
that  span  the  same  lattice.  Since  a  and  b  are  relatively  prime,  there  exist  integers  k  and  l  such 
that  ak  —  bl  =  1.  The  two  vectors  (bk  +  al,  1)  and  (a2  -f-  62, 0)  generate  the  same  lattice  as  the 
original  basis  vectors  (a,  6),  (—6,  a).  Thus  any  pixel  (x,  y)  can  be  mapped  to  a  pixel  (xo,J/o)  where 
2/o  =  0  and  x0  €  [0,  M),  which  means  xo  alone  can  serve  as  the  chip  number  for  the  pixel.  If  we 
denote  C  =  bk  al,  and  recalling  that  M  =  a2  -f  62,  the  chip  number  for  an  arbitrary  pixel 
(x,  y)  is  x  — ■  Cy  (mod  A/).  The  address  of  the  pixel  the  ordered  pair  {[x/M J,y),  which  is  also 
easy  to  compute. 

An  advantage  of  any  doubly  periodic  organization  that  should  be  mentioned  concerns  the 
communication  among  the  memory  chips.  Typically,  each  chip  has  a  single  connection  to  an 
Af-pixel  buffer.  To  move  a  rectangle  of  pixels,  three  steps  are  required.  The  rectangle  of  pixels 
is  read  into  the  buffer,  the  pixels  in  the  buffer  are  permuted,  and  the  pixels  are  written  back  to 
the  memory  chips  at  different  locations.  The  advantage  of  the  periodic  organization  is  that  the 
set  of  permutations  encompasses  only  circular  shifts  of  the  buffer.  Thus  a  standard  barrel  shifter 
can  be  used  for  all  permutations. 

One  issue  that  we  have  not  faced  is  the  problem  of  generating  addresses  for  each  of  the  M 
chips  given  some  standard  specification  of  the  rectangle  to  be  accessed.  Whether  the  address 
calculations  can  be  made  possible  at  reasonable  cost  requires  an  engineering  analysis.  The  com¬ 
peting  concerns  are  the  the  strong  regularity  of  the  lattice-based  organization  which  should  help 
the  design  versus  ie  need  to  perform  modular  arithmetic  which  could  require  much  hardware. 

7.  Comments 

The  Fibonacci  lattice  organization  of  memory  allows  all  rcctilinearly  oriented  rectangles  of 
a  given  size  to  be  accessed.  Not  surprisingly,  some  economy  in  hardware  can  be  gained  by 
being  more  restrictive.  For  example,  the  memory  organization  based  on  the  lattice  generated  by 
the  basis  vectors  (l,s)  and  ( —  s,  1)  allows  three  types  of  rectangles— s-by-s,  one- by- a2  +  1,  and 


s2  ■+■  1-by-onc — to  be  accessed  efficiently.  The  number  of  memories  required  by  this  scheme  is 
M  =  s2  +  t. 

The  Fibonacci  lattice  organization  can  also  be  used  to  speed  up  the  access  rate  in  machines 
with  interleaved  memories.  For  example,  the  organization  might  be  useful  for  matrix  and  image 
processing  applications. 

An  interesting  question  is  how  to  extend  the  constructions  and  bounds  of  this  paper  to 
dimensions  higher  than  two.  For  example,  the  analogous  question  for  three  dimensions  would 
be,  “How  does  one  construct  a  dense  set  of  points  in  the  three  dimensional  space,  such  that  no 
two  points  of  the  set  are  contained  in  the  interior  of  a  rectilinearly  oriented  box  of  volume  1  ?” 
and,  if  such  construction  is  at  all  possible,  "What  is  the  maximum  density  possible  for  such  a 
set?"  In  fact,  we  can  construct  a  lattice  whose  density  is  1/7,  satisfying  the  “compatability” 
requirements.  Using  a  tilted  cube  of  edge  length  s/3,  we  can  show  (by  an  argument  similar  to 
that  in  [1],  Theorem  1)  that  1  /3>/3  is  an  upper  bound  on  the  density  of  such  sets.  It  remains  to 
be  seen  whether  the  techniques  from  Section  2  can  be  applied  to  achieve  a  tighter  lower  bound 
for  the  three-dimensional  case. 
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